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ABSTRACT 

We investigate the distributions of the orbital period ratios of adjacent planets in high 
multiplicity Kepler systems (four or more planets) and low multiplicity systems (two 
planets). Modeling the low multiplicity sample as essentially equivalent to the high 
multiplicity sample, but with unobserved intermediate planets, we find some evidence 
for an excess of planet pairs between the 2:1 and 3:1 Mean Motion Resonances in the 
low multiplicity sample. This possible excess may be the result of strong dynamical 
interactions near these or other resonances or it may be a byproduct of other evolu- 
tionary events or processes such as planetary collisions. Three planet systems show 
a significant excess of planets near the 2:1 Mean Motion Resonance that is not as 
prominent in either of the other samples. This observation may imply a correlation 
between strong dynamical interactions and observed planet number — perhaps a rela- 
tionship between resonance pairs and the inclinations or orbital periods of additional 
planets. The period ratio distributions can also be used to identify targets to search 
for missing planets in the each of the samples, the presence or absence of which would 
have strong implications for planet formation and dynamical evolution models. 

Key words: celestial mechanics; methods: data analysis; techniques: photometric 



1 INTRODUCTION 

NASA's Kepler mission has iden tified several thou- 
sand candida t e ex oplanet systems l|Borucki et all l201ll ; 
iBatalha et ail 120121 ). A significant, and important, sub- 
set of thes e systems show mu ltiple candidates orbiting the 
same star jSteffen et alj|2010h . From these multiplanet sys- 
tems we gain important insights into the system archi- 
tectures, dyna mics, and the overa ll population of plan- 
etary systems |Lissauer et all 20 111: IFabrvckv et all 120121 : 



Ford et all l2011bl: iFang and Margol l2012al ). For example. 



Latham et all (|201ll ) found that short period Jupiter-sized 



planets do not show additional transiting bod ies that have 
orbita l periods within a factor of a few tens. ISteffen et al.l 
|2012l ) subsequently showed that nontransiting planets are 
not found on orbits near low-order mean-motion resonance 
(MMR) with hot Jupiter candidates — an observation that 
can effectively eliminate some models for the formation of 
hot Jupiter systems. 

ISteffen et all (|2012h made these statements using a 
bulk search for transit timing variations (TTVs) which 
are deviations from a constant period that result from 
gravitationa l interactio n amo n g the various planets within 
a system (lAgol et al.l 120051 : iHolman and Murravl [2005; 
iMiralda-Escudel |2002| ). TTVs have been a boon to exo- 
planet science from the Kepler mission as TTV methods 



have dynamically confirmed most of the known Kepler plan- 
ets dHclman et all l20ld: IFabrvckv et all [201ll ; iFord et all 
l2011al : ISteffen et all |2012|. |2013| ). including nontransiting 



planets (|Ballard et al.l 201ll ). as well as with sev eral uncon- 



firmed systems showing evidence for this signal (|Ford et al.l 
120121 : ISteffen et ai]|2012l ) . 

The properties of the Kepler multiplanet systems sug- 
gest that most planets observable by Kepler live in mult i- 
planet systems |Lissauer et all EiOllI ; IFabrvckv et al.l 120121 ). 
However, the census of planets in the Kepler catalog is 
incomplete because mutual orbital inclinations among the 
planets in the systems mean that some planets simply do 
not transit the host star, and therefore are n ot detected by 
the K epler photometry alone. Nevertheless, iLissauer et alj 
l|201ll ) showed that these mutual orbital inclinations in mul- 
tiplanet systems are only a few degrees, and that plane- 
tary sytems tend to be very flat, as would be expected from 
planet formation within a gasous protoplanetary disk. Thus, 
some systems that have a low multiplicity of observed plan- 
ets are likely to host nearly coplanar, but unseen sibling 
planets. 

Here we use the orbital period distribution of high mul- 
tiplicity Kepler systems as a means to study lower multi- 
plicity systems. From this study we can make some state- 
ments about where we may expect to find additional non- 
transiting planets. The absence of these additional plan- 
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ets would have very significant ramifications regarding the 
formation and dynamical evolution of the systems in ques- 
tion. Some models of planet formati on suggest that plane- 
tary s ystems are dynamically packed fLaskai 2000l; Lissauerl 
1 19951 ; Barnes and Ravmondlfiooi ; iFang and Margotll2012bf ) 
such that stable orbits in a planetary system are likely to 
be occupied. Studies of the architectures and dynamics of 
Kepler systems indicate that virtually all of the Kepler 
multiplanet systems are likely stable and t hat the ratios 
of orbital periods are gen erally not large (jLissauer et al.l 
l201ll ; iFabrvckv et al.l |2012| ) . These observations are consis- 
tent with the predictions of dynamical packing. However, 
we know from the population of hot Jupiter planets that 
not all systems share a common dynamical history; and 
that that relatively small population has motivated new 
ideas for models of planet formation and dynamical evo- 
lution. The identification of any new population of systems 
with a unique history would be comparably valuable — such 
as the lo ng-predicted but newly discovered c ircumbinary 
syste ms (|Dovle et all l201ll ; IWelsh et all |2012| ; lOrosz et all 
2012). Likewise, the absence of a detectable population of 
dynamically distinct systems also makes strong statements 
both about the formation and histories of planetary systems 
generally and about the very large sample of Kepler systems 
that show only a single planet candidate. 

In Section[2]we investigate some of the properties of the 
distribution of the period ratios of adjacent planets in sys- 
tems with four or more observed planet candidates. Section 
[3] uses the results from the high multiplicity sample to make 
and test predictions about what we should see in the low- 
multiplicity sample where only two planet candidates are 
seen in each system. Section [4] makes a similar comparison 
with the three-candidate systems. We investigate the newly 
released, preliminary planet candidate list that results from 
the analysis of Kepler data through quarter 12 (Q12) in 
secction [5] We make concluding remarks in Section [6] 



2 HIGH MULTIPLICITY SAMPLE 

Consider th e sam ple of candidate systems from 
iBatalha et alj (|2012T ) (hereafter B12) that contain four 
or more candidates. We will call t h is sam ple the "high 
multiplicity" systems. iLissauer" et all (|2012h shows that all 
multiple candidate systems have a high probability of being 
true planetary systems — a statement that is especially 
true as the multiplicity (i.e., the number of candidates on 
a single target) grows. Thus, we claim that the Kepler 
multiple candidate systems are a sufficiently pure sample 
of true planetary systems for our study and we will use 
the terms "planet" , "planet candidate", and "candidate" 
are used interchangeably. The number of high multiplic- 
ity systems from B12 is 36 with a total of 154 planets 
(averaging 4.3 planets per system) however we drop two 
systems from this sample, one where a planet has a negative 
period (meaning that only one transit had been seen) and 
a second where the system is u nstable according to the 
criterion given in iGladmanl (|l993h assuming Neptune mass 
planets. The final high multiplicity sample thus contains 
34 planetary systems with 146 planets, giving 114 adjacent 
planet pairs for study. 
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Figure 1. Histogram of period ratios for high multiplicity sample. 
The binning of this histogram is chosen to show some of the known 
features near the 3:2 and 2:1 MMRs. 



ILissauer et al.l I^Oll! - ) (hereafter Lll) showed that the 
typical mutual inclination of the orbits in multiplanet sys- 
tems is a few degrees; multiplanet systems are highly copla- 
nar. High multiplicity systems likely represent the low 
mutual-inclination side of that distribution and we use them 
here as a proxy for multiplanet systems generally. Figure [T] 
shows a histogram of the orbital period ratios of adjacent 
planets in this high multiplicity sample. In t his figure we can 
see so me of the features discussed in Lll and Fab rvckv et alj 
(|2012f) — specifically the excess of planets near the 3:2 MMR 
and the lack of planets just interior to the 2:1 MMR. Another 
feature of potential interest is the marginally significant peak 
near the 7:2 MMR. The 7:2 resonance is not normally impor- 
tant dynamically without some modest eccentricity in one 
or both of the planets in the pair. 

The shape of the log of the period ratio distribution 
is reasonably well approximated by a truncated Rayleigh 
distribution where the minimum value of the distribution 
is approximately 0.05 (corresponding to a period ratio of 
10 05 = 1.1 and the Rayleigh parameter is 0.23. This value 
of the Rayleigh parameter is found using maximum likeli- 
hood estimation on the truncated distribution. We do not 
here ascribe any physical significance to this parameteri- 
zation, only noting that it is generally consistent with the 
shape of the observed distribution for the high multiplicity 
sample. Orbital stability requirements will natually impose 
some truncation at smaller values as we have done — though 
the functional form of the distribution at small separations, 
our approximation notwithstanding, is certainly not a step 
function. This distribution tends to over estimate the num- 
ber of planets in the region roughly between the 2:1 and 4:1 
MMRs. Regardless, for this work neither of the tails of this 
distribution play a rigorous role and the overestimation of 
the number of systems at modest period ratios will tend to 
make later results more conservative. 

We expect that most of the planets that lie between the 
innermost and the outermost planet in the high multiplicity 
systems are accounted for. However, some planets will in- 
evitably be absent due to either inclination of the orbit or 
the smalln ess of the planet's t ransit signal (a good example is 
Kepler-36 (|Carter et al.ll2012l ) which had an undetected inte- 
rior planet that was missed because of its small size and large 
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TTVs). Missing planets will extend the tail of the period ra- 
tio distribution to larger separations. These large gaps in the 
observed planetary systems are potentially very interesting 
since they could contain an additional planet, they could be 
caused by some physical property of the protoplanetary or 
planetesimal disk (e.g. the presence of a phase transition in 
the gas), or they could be a sign of some dynamical event 
or events in the history of the system. 

For observational purposes a simple cut in period ratio 
would suffice to identify good candidate systems to search 
for the missing intermediate planets (e.g. all planets where 
adjacent period ratios are larger than 5:1 or 6:1). We do, 
however, wish to make some estimate of the probability that 
the tail of the distribution is from missing planets rather 
than simply the natural tail of the distribution. To do this 
we iteratively remove data from the long tail of the data 
and refit a Rayleigh distribution with both tails truncated. 
If the data are indeed drawn from this distribution, then 
the resulting fit to the Rayleigh parameter should be largely 
independent of the location where the cuts were made. Out- 
liers, on the other hand, would show up as a decrease in the 
value of the Rayleigh parameter as the outlier distribution 
was systematically removed from the analysis — lessening the 
effects of the tail. In fact, the value of the Rayleigh parame- 
ter does drop from 0.23 to 0.21 as a result of this truncation. 
This difference, if measured with high confidence, can cause 
a nontrivial change to the length of the tail and the height 
and location of the peak — especially since the period ratio 
is the exponential of this quantity. 

To estimate the statistical significance of this decline in 
the Rayleigh parameter, we conducted a Monte Carlo sim- 
ulation where we drew multiple 114- planet samples using a 
Rayleigh distribution with the large-ratio tail truncated at 
a variety of values (the small-ratio tail is consistently trun- 
cated at a period ratio of 1.1). The variance in these samples 
gives us an estimate of the expected statistical fluctuations, 
which we find to be on the order of 0.1. This estimate indi- 
cates that the widely separated planets cannot be excluded 
as outliers with high confidence (the decline in the Rayleigh 
parameter is only ~ 1.5a). With additional data on these 
systems, a more sophisticated analysis, or a distribution de- 
rived by some other means one might be able to make more 
definitive statements about the few systems with large gaps 
in the high multiplicity sample. 

Regardless of the statistical significance of the largest 
gaps, the systems associated with them should prove inter- 
esting objects for study. Mass measurements of the planets 
in those systems (to see if there is any large change in planet 
mass near the gaps that might indicate where the formation 
material accumulated) and searches for additional planets, 
whether by TTVs or from Radial Velocity (RV) measure- 
ments, will yield important information for general planet 
formation models. The value of these searches is high regard- 
less of whether additional planets are present or are absent 
since either result has implications for theoretical models. 

We note, and show in Figure [2l that there is no ob- 
vious correlation between the size ratio of adjacent planets 
and the period ratio of adjacent planets in the multiple can- 
didate systems. If the largest gaps in the planetary systems 
corresponded to some important transition in the protoplan- 
etary disk, as opposed to a missing planet, one might expect 
that the inner and outer planet of the widely separated pair 
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Figure 2. Size ratio of adjacent planets as a function of the 
period ratio of the adjacent planets for the Kepler multiplanet 
systems. Points corresponding to the high multiplicity sample are 
highlighted with circles. We note the increased variance for the 
most closely spaced pairs. 

would have dissimilar sizes. This figure confirms the known 
obsevation that ex terior planets tend to be slightly larger 
l|Ciardi et al.ll2012T ). However, there also appears to be larger 
variance in the size ratio of adjacent planets in more closely 
spaced orbits than in the more widely separated orbits — 
both for systems with larger interior and with larger exte- 
rior planets. Some observation bias may be at play here, but 
the same mechanisms that produced the planets in Kepler- 
36 l|Carter et alj|2012l ) which have distinctly different den- 
sities but nearly the same orbital distance, may also explain 
this increase d variance. This in crease in variance was not 
observed in (|Ciardi et al.i r2012l due to the cuts in transit 
signal-to-noise ratio (SNR) that are applied therein. Those 
same cuts are not employed here since our primary concern 
is with period ratios instead of radius ratios and uncertain- 
ties in planet size do not have a significant effect on our 
results. Nevertheless, if we consider only those systems with 
a single transit SNR greater than 10 for all planets in each 
system, for example, then only ~ 10% of the systems are 
removed (only one of the high multiplicity systems is re- 
moved). The low SNR systems do not have any observable 
correlation with period ratio and our results would not be 
materially affected. 



3 TWO-PLANET SYSTEMS 

Now we turn to low multiplicity systems, the 244 systems 
that have only two transiting planetfl A reasonable hy- 
pothesis about these systems is that they are fundamentally 
the same as the high multiplicity systems but have slightly 
larger mutual inclinations. If this were the case, then the low 
multiplicity systems should have essentially the same num- 
ber of planets in them as the high multiplicity sample, but 
with two or more planets that are not seen simply because 
they do not transit. Such systems would still be essentially 
coplanar as the inclinations need not be large to prevent a 
planet from transiting. 

1 There are 245 systems listed in B12, but one has a planet with 
a negative (unmeasured) period and is therefore excluded. 
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In principle, it is no more likely that a planet would 
be missed because of its small size in the low multiplic- 
ity sample than it is for the high multiplicity sample. It 
is conceivable that many of the low multiplicity sample are 
in fact high multiplicity systems with all planets transiting 
but, due to some detection bias that limits the minimum de- 
tectable planet size, some planets are missed (e.g., perhaps 
low multiplicity systems are noisier or more dim). Several 
facts, however, do not support this hypothesis. For exam- 
ple, the median size of the planets in each sample are very 
similar at 2.1 and 2.2 R® for the high and low multi- 
plicity samples respectively. In addition, the average of the 
first six quarters of Combined Differential Photometric Pre- 
cision (CDPP) of the high multiplicity sample is slightly 
worse, with a median of 183 ppm, than the low multiplicity 
sample, with a median of 128 ppm. Thus, planets that were 
missed because of noise would more likely have occurred in 
the high multiplicity sample. There remains, however, the 
caveat that the Kepler planet candidate catalog has not yet 
been uniformly vetted and some more subtle bias could af- 
fect the interpretation of these results. 

If the low and high multiplicity systems are fundamen- 
tally the same, then we should be able to reproduce the pe- 
riod ratio distribution of the low multiplicity systems from 
the high multiplicity distribution. We attempt to do this by 
drawing a sample of planet pairs where the period ratio is 
either drawn from the best fitting truncated Rayleigh dis- 
tribution from above, or is generated from two or three re- 
alizations of the that distribution being multiplied together 
(added in log space). For example, if a low multiplicity were 
missing the second planet, then the observed period ratio 
for the first and third planets would be the product of the 
period ratio of the first and second planets with the ratio 
of the second and third planets — where both of the ratios 
would be drawn from the same distribution. The relative 
numbers of realizations in this sample with one or two miss- 
ing planets would be given by the relative probability of a 
planet to transit ~ (ai n /a out ) ~ 1/1.5 where (ai n /a out ) is 
the mean semimajor axis ratio of the underlying distribu- 
tion (here we ignore the small contribution that would come 
from five or six planet systems where three or four planets 
would be missing). 

We thus construct an approximation to the low multi- 
plicity sample using 10000 realizations from the truncated 
Rayleigh distribution, 6000 from the product of two realiza- 
tions, and 4000 from the product of three realizations. As 
shown in Figure the bulk properties of the low multiplic- 
ity distribution can be reproduced in this manner. However, 
there are a few notable discrepancies. The peak is slightly 
underestimated and the wider separations (between 4:1 and 
8:1) are slightly overestimated — as would be expected from 
the stated over estimate of modest period ratios from the 
underlying distribution. Both of these differences can be re- 
duced slightly by using a more narrow model distribution 
such as what one would get if the largest values in the high 
multiplicity distribution were considered to be part of an 
outlier tail. We note that the period ratio distribution of 
the inner two planets of the high multiplicity sample is not 
similar to the low multiplicity sample — indicating that miss- 
ing planets are needed to explain the overall shape of the 
low multiplicity distribution. A similar test, drawing directly 
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Figure 3. Histogram of period ratios for low multiplicity sam- 
ple as well as two simulated distributions constructed using the 
Rayleigh distribution that is estimated from the high multiplicity 
sample (as described in the text). The solid line corresponds to 
a Rayleigh parameter of 0.23 and the dashed line has a Rayleigh 
parameter of 0.21 (which is what one gets when dropping the 
widely spaced planets — those beyond a ratio of ~ 6 — from the 
high multiplicity sample). 



from the empirical distribution of the high multiplicity sys- 
tems yields virtually identical results. 

One feature of note in the low multiplicity sample that 
cannot be readily explained using our simple model is a 
slight excess of pairs in the region between the 2:1 and 3:1 
MMRs. In general, it is difficult to produce this excess sim- 
ply by the method described above — that is, by assuming 
missing planets that are drawn from the high multiplicity 
distribution. If we assume that many systems show first and 
third planets with the second planet missing, then the close- 
proximity tail of the resulting distribution is far too low. On 
the other hand, we assume that most of the low multiplicity 
distribution is made of true adjacent pairs, then the widely- 
separated tail of the resulting distribution becomes too low. 
Even if we ignore the effects on the two tails, one must still 
rely on statistical fluctuations to produce the observed ex- 
cess. 

To estimate the significance of this excess under the 
missing planet model we conduct a Monte Carlo test. To 
be conservative, we assume that the probability of seeing 
the first two planets in a system is equal to missing a sec- 
ond planet while seeing a third — an assumption that fa- 
vors an excess of longer-period pairs, though not by a large 
amount. Of the 244 low-multiplicity Kepler candidate pairs, 
72 (about 30%) have period ratios between 2.1 and 2.9. We 
drew 10,000 samples of 244 pairs (with 122 that assume 
no missing planet and 122 that assume the second of three 
planets is missing) drawn from the truncated Rayleigh dis- 
tribution shown in Figure[T] — where the period ratio must be 
greater than 1.1 and the Rayleigh parameter is 0.23. From 
these samples only 70 of 10,000 had 72 or more planet pairs 
in this same region, or less than 1%. Thus, there is strong 
(but not necessarily definitive) evidence for some additional 
population of planetary systems with period ratios in this 
region. 

If one changes the neighborhood of interest (instead of 
considering only planet pairs with period ratios between 2.1 
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Figure 4. The probability, estimated by Monte Carlo simula- 
tion, of statistical fluctuations producing the observed excess as 
a function of the starting and ending locations of the region of 
interest. The low probability region in the upper left implies that 
statistical fluctuations are relatively unlikely to produce the ob- 
served excess when considering large regions between the 2:1 and 
3:1 MMRs. The peaks along the diagonal occur because there are 
fewer observed pairs with these period ratios and one can more 
easily produce the observed excesses when considering these nar- 
row regions. 

and 2.9), then the results of the Monte Carlo simulation give 
different, though not unexpected results. In particular, if on 
considers a narrow neighborhood near the 2:1 MMR, then 
the probability of seeing the corresponding excess can rise to 
just under 30%. Or, if one considers a narrow neighborhood 
near the 3:1 MMR, then the probability of seeing the cor- 
responding excess rises to 20%. This effect arises primarily 
because there are relatively fewer candidates observed in the 
two-planet systems in these narrow windows. Because of this 
effect, we believe that averaging over the large neighborhood 
that we originally state is appropriate for this test. 

If the underlying Rayleigh distribution is indeed more 
narrow due to missing planets in the high multiplicity sam- 
ple, then the probability of producing the observed excess 
decreases further. Therefore, the ~ 0.01 false alarm proba- 
bility should be a robust upper bound. We show, for com- 
pleteness, in Figure [4] the probability of statistical fluctu- 
ations producing the excess observed as a function of the 
values of the start and end of the neighborhood where the 
excess is measured. For example, the probability of randomly 
producing the observed number of planets with period ratios 
between 2.2 and 2.4 is 0.25 while the probability of randomly 
producing the number of observed planets with period ratios 
between 2.4 and 2.8 is 0.05. Using our model, the expected 
number of systems in the 2.1 to 2.9 range is approximately 
55 — as opposed to the 72 observed pairs, implying that per- 
haps of order 20% of these systems are anomalous. 

A few explanations of this excess are both straightfor- 
ward and can be tested with further analysis and/or more 
data. One possibility is that the observed excess is indeed 



a statistical fluctuation. More data from Kepler (which we 
conduct below) or from other exoplanet surveys will test this 
hypothesis. A second possibility is that there are, in fact, 
missing planets in the two planet systems such that the first 
and third planets transit and have a period ratio between 2 
and 3. While this is expected to be the case some fraction 
of the time, a challenge for this scenario being the primary 
explanation for the excess is that it implies that third plan- 
ets are easier to detect than second planets and that the 
observed low-multiplicity systems are, in fact, more densly 
packed than the high multiplicity systems. A third explana- 
tion, which we call the "dynamical" hypothesis, is that the 
dynamics of the 2:1, 3:1, 5:2 or other high-order MMRs in 
this region are strong and planets have become trapped in 
their orbits due to the presence of these MMRs. A fourth 
explanation, which we call the "kinematic" hypothesis, is 
that at some point during the formation of the system the 
orbits of two planets overlapped which eventually led to the 
collision of those planets. The consequence of such a col- 
lision would be a single remaining planet, on an orbit that 
was intermediate to the initial pair. Finally, the observed ex- 
cess could be due to some other process that occurred either 
during the formation or subsequent dynamical evolution of 
the system. Regardless, further scrutiny of these sytems is 
justified. 

Collisions, the kinematic hypothesis, occasionally sur- 
faces as a me ans to explain o b served features in planetary 
systems (e.g., Ijohansen et al.l ||2012| )). Given the observed 
tightly packed planets in the Kepler high multiplicity sam- 
ple, it is not unreasonable to believe that some may have 
gone unstable — with subsequent orbit crossings and colli- 
sions. A simple demonstration of the effect can be achieved 
through (yet another) Monte Carlo simulation. Here, we 
draw a pair of planets from the same Rayleigh distribution 
as above and assign each a random orientation and an ec- 
centricity drawn from a second Rayleigh distribution with 
Rayleigh parameter 0.05 (the functional form of the eccen- 
tricity distribution is largely irrelevant for our purposes as 
we only need the eccentricities to provide overlapping or- 
bits). If the orbits of the two planets overlap, then we ran- 
domly choose one of the two intersection points and kine- 
matically derive the outcome of a completely inelastic colli- 
sion assuming conse rvation of ang ular momentum — similar 
to what was done in lLaskaij (|200Ch . 

The results of this Monte Carlo simulation is shown 
in Figures [5] and [6] Figure [5] shows the initial distribution 
of period ratios and the period ratios of the post-collision 
systems. Here we only had collisions between the outer two 
planets, but if it had been the inner two planets that collided 
the results would be essentially the same since it is the ra- 
tios of the periods that matter. We see that the distribution 
shifts to the left. While the shift in this toy model does not 
coincide with the location of the observed excess, the amount 
that the distribution shifts will depend upon the allowed ini- 
tial conditions for the crossing planets. If eccentricities tend 
to be small, then only systems that are more tightly packed 
would produce collisions with the result that the peak would 
shift by a smaller amount, nearer the observed excess. This 
would be a reasonable constraint for the initial conditions 
since the more tightly packed systems would likely be the 
systems closest to the limits of stability where crossing orbits 
would be generated in the first place. 
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Figure 5. Histogram of the starting period ratios (solid bars) and 
ending period ratios (line) if one assumes simplified, kinematic 
collisions between two planets where the collisions are completely 
inelastic and conserve angular momentum. The shift of the peak 
to the right (larger separations) may be an underlying explana- 
tion for the observed excess of planets in low multiplicity systems 
between the 2:1 and 3:1 MMRs. 
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Figure 6. Histogram of the starting eccentricities (solid bars) and 
ending eccentricities (line) of the planets if one assumes simplified, 
kinematic collisions between two planets where the collisions are 
completely inelastic and conserve angular momentum. The loss of 
eccentricity in collisional systems may be an observable that could 
distinguish collisions from strong dynamical interactions as the 
explanation of the observed excess of planets in low multiplicity 
systems between the 2:1 and 3:1 MMRs. 



A second consequence of collisions is a reduction in ec- 
centricity as shown in Figure [6] For this simulation the me- 
dian (and mean) of the eccentricity distribution is reduced 
by 35%. This reduction in the eccentricity distribution may 
provide a means to distinguish the dynamical hypothesis 
of strong, high-order MMRs and the kinematic hypothesis, 
of crossing orbits and planetary collisions, for the origin of 
these planet pairs. If strong dynamical interactions via high- 
order MMR are responsible, then large eccentricity values 
would be expected since high order resonances are otherwise 
suppressed by multiple factors of the eccentricity. However, 
if collisions are the cause, then the eccentricities of the re- 
sulting systems may be small by comparison since collisions 
suppress eccentricity. 



4 THREE-PLANET SYSTEMS 

While not the primary focus of our study, we now make 
some comments on the distribution of period ratios in the 
observed three-planet systems. There are 83 three planet 
systems with 166 associated planet pairs that satisfy the sta- 
bility criterion. We deliberately excluded three-planet sys- 
tems from the preceding study so that distinctions between 
the high multiplicity and low multiplicity systems in the 
B12 catalog could be more clearly identified. Nevertheless, 
the three planet systems also show some interesting features 
that merit further investigation. 

Figure [7] shows the period ratio distribution for systems 
with three planet candidates as well as the predicted dis- 
tribution using a method essentially the same as what was 
done with the two-planet systems but only assuming a single 
missing planet. The most striking feature of the three planet 
distribution is the large peak surrounding the 2:1 MMR — a 
peak that is not nearly as prominent in the high multiplicity 
sample of Figure[T] The peak just exterior to the 2:1 and and 
the small trough that just interior to it (the trough is more 
readily visible in the high mu ltiplicity sample) has been the 
subject of additional s t udy (iBatygin and Morbidellil 120131 : 
iLithwick and Wull2012l : iPetrovich et al.ll2012l ) and may be 
generated naturally during the planet formation process 
or during the subsequent dynamical evolution in the pres- 
ence of a dissipative force. In the three-planet systems, the 
peak near the 2:1 appears to be a true excess and may be 
caused by orbital migration after the planets have formed 
i|Thommesll2005l ; IZhou et al.| [2005). A smaller excess is also 
visible near the 3:1 MMR that has no counterpart of com- 
parable size in the high multiplicity sample. 

Since roughly 25% of the three planet systems are near 
the 2:1 MMR, several appear to be near 4:2:1 resonance 
chains or three-body, Laplace- type resonances. The observed 
number of 5 triples where the period ratios of all planets 
are within 5% of 4:2:1 is consistent with a random selection 
from the observed three planet distribution where one would 
expect 5.2 triples. Nevertheless, these near-chains contribute 
materially to the peak near the 2:1 MMR. It is not clear 
whether the near 4:2:1 chains are a consequence of or the 
cause of the excess near 2:1. Investigating the dynamical 
properties of these systems will be valuable for identifying 
their histories. 

The presence of the excess planets near the 2:1 MMR in 
the three-planet systems and the likely excess of planets be- 
tween the 2:1 and 3:1 MMRs in the two-planet systems may 
indicate that dynamical interactions play a strong role in de- 
termining the number of planets in a system — or at least the 
number of observed planets. If the orbits in these systems 
have modest eccentricities, then the dynamical interactions 
could excite the inclinations of nearby planets and reduce 
the number of planets that transit. Or, strong dynamical 
interactions may render nearby orbits unstable or preclude 
planets from forming or migrating into such orbits (though 
there is no compelling evidence in these data that the third 
planets in systems with a near-resonant pair have unusually 
wide separations). A detailed exploration of these possibili- 
ties lies beyond the scope of this work. As with the features 
of the two-planet systems, more data and specialized anal- 
ysis should produce important results that lead to a better 
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Figure 7. Histogram of period ratios for three-planet systems as 
well as the simulated distributions constructed using the Rayleigh 
distribution that is estimated from the high multiplicity sample 
(as described in the text). The large excess of planets near the 
2:1 MMR is not seen in the high multiplicity sample and is more 
localized than the possible excess seen in the two-planet systems. 



understanding of the histories of planetary systems gener- 
ally. 



5 RESULTS FROM PRELIMINARY QUARTER 
12 CANDIDATES 

In early January 2013 the Kepler mission released its cur- 
rent list of KOIs found using data through quarter 12 of 
operations (Q12). The public list is preliminary and some 
changes to it are expected. Nevertheless, an analysis of this 
preliminary list is valuable as a means to see if the possible 
anomalous features in the period ratio distributions seen in 
the B12 catalog remain when more data are included in the 
analysis. We downloaded the KOI list from the NASA Exo- 
planet Science Institute on January 8, 2013 and conducted 
tests with those data similar to those done for the B12 cat- 
alog. In the Q12 table there are 299 two-candidate systems, 
111 three-candidate, and 55 high multiplicity systems (once 
the stability criterion is invoked). 

The basic conclusions from the B12 catalog remain un- 
changed. That is, the overall features of the high multiplicity 
sample remain the same — the location of the peak, the ex- 
tent of the tail, and the excess near the 3:2 MMR. The slight 
excess between the 2:1 and 3:1 MMR for the low multiplic- 
ity sample is still present. While its significance diminishes 
somewhat (to about 1.5% over the period ratio interval from 
2.1 to 2.9), it does so in an intriguing manner described be- 
low. The large excess near the 2:1 MMR in the three-planet 
systems remains prominent while the small excess just inte- 
rior to the 3:1 MMR grows somewhat. Figures of the period 
ratio distributions from Q12 are not shown as they are not 
markedly different from the distributions derived from the 
B12 catalog. 

For the low multiplicity sample, the number of systems 
increased from 244 to 299. However, there are 90 new sys- 
tems overall. The difference arises because of the discovery 
of new planet candidates in 33 of the systems from B12 and 
two systems that do not appear in the new list. The dis- 
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Figure 8. A comparison of the initial period ratio of the two- 
candidate systems to the final period ratios of the three-candidate 
systems to which they were promoted via the identification of a 
third planet candidate. The final period ratios for a given sys- 
tem are joined by a solid vertical line. The dashed vertical and 
horizontal lines mark the 2:1 and 3:1 MMRs in both the initial 
and final samples respectively. The slanted, dashed line indicates 
the same initial and final period ratio. Thus, if a new candidate is 
discovered interior to or exterior to the initial two-candidate pair, 
then one of the two ends of the corresponding solid vertical line 
will be on this slanted line. If the solid vertical line lies entirely 
below the slanted line, then an intermediate planet was discov- 
ered between the original pair. No vertical lines lie entirely above 
the slanted line (which would otherwise indicate that one of the 
inital periods was an alias of the true period). Note that several of 
the initial pairs with large period ratios resulted in triple systems 
where one or more period ratios are near the 2:1 or 3:1 MMR. A 
few of the new KOIs are seen to have very large separations from 
the initial pair. 

covery of new candidates is not surprising and is consistent 
with the overall observation that the period ratio distribu- 
tion for low-multiplicity systems can be largely modeled in 
the manner described above. Seven of the low multiplicity 
systems added two or more planets — joining the high mul- 
tiplicity sample. Of these, five showed either additional in- 
terior or exterior planet candidates. The remaining two sys- 
tems added planets between the two known candidates and 
in one of those cases the period ratio of the original pair was 
2.22 — removing one low multiplicity system from the region 
of the observed excess. 

Many more low multiplicity systems (26 of them) added 
a single new candidate — joining the three-planet sample. Of 
these, a significant fraction were systems with period ratios 
between the 2:1 and 3:1 MMR. Most pairs simply added a 
new planet either interior to or exterior to the known pair; 
in some cases the third planet is very widely separated. Sev- 
eral two-planet systems (most with period ratios larger than 
3:1) added intermediate planets and a significant fraction 
of those additional planets generated period ratios in the 
anomalous region near or between the 2:1 and 3:1 MMRs. 
Figure [8] shows the inital period ratios of the two-candidate 
systems that became three-candidate systems and the two 
resulting period ratios of the three-candidate systems. One 
can see in that figure the increased number of new period 
ratios that are in the anomalous region. 
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Figure 9. The period ratio distribution for the combined two- 
candidate and three-candidate systems using the Q12 candidate 
list. Two significant peaks are visible near the 2:1 and 3:1 MMRs 
that are not nearly as prominent in the high multiplicity distri- 
bution in Figure [TJ The peak near the 3:2 MMR is visible in all 
samples. 

The result of the additional planets found in the low 
multiplicity systems from B12 is a decrease in the number 
of systems just inside the 5:2 MMR (and hence a decrease in 
the significance of the observed excess between the 2:1 and 
3:1 MMRs) but a marked increase in the fraction of three 
planet systems near the 2:1 and 3:1 MMRs. These facts sug- 
gest that the possible excess in the low multiplicity sample 
between the 2:1 and 3:1 MMR and the observed excess near 
these MMRs in the three-candidate systems may originate 
from the same evolutionary processes. To illustrate this, we 
show in Figure[9]a combined period ratio distribution for the 
Q12 candidates — combining the three-candidate and two- 
candidate systems. The large peaks near the 2:1 MMR and 
the 3:1 MMR are not present in the high multiplicity sam- 
ple (Figure [TJ. Further study of these systems should shed 
light on their origin and may offer definitive evidence of a 
population of exoplanet systems with a unique history. 

A Monte Carlo simulation of the combined two and 
three planet systems using the period ratios of the high mul- 
tiplicity systems shows that the observed excess near the 2:1 
and 3:1 MMRs are very unlikely to be caused by statistical 
fluctuations. Figure \W\ is a contour plot of the probability 
of statistical fluctuations producing the observed number of 
systems in a neighborhood as a function of the beginning 
and ending values that define the neighborhood. The re- 
gion where the probability is high corresponds to the region 
where some two planet systems were removed due to the ob- 
servation of additional planet candidates in those systems. 
Neighborhoods near the 2:1 and 3:1 MMRs have a very low 
probability that they are statistical in nature. 



6 DISCUSSION 

In this paper we explore the differences between the observed 
period ratio distributions for systems with high multiplicity 
(four or more planet candidates) and low multiplicity (with 
two planet ca n didate s) in the Kepler catalog published by 
iBatalha et all |2012[ ). If we make the straightforward as- 
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Figure 10. The probability, estimated by Monte Carlo simula- 
tion, of statistical fluctuations producing the observed excess in 
the combined two-planet and three-planet period ratio distribu- 
tion as a function of the starting and ending locations of the 
region of interest. The high probability region on the left side 
is due to the lower number of observed pairs in that region. The 
low probability near the 2:1 and 3:1 MMRs implies that statistical 
fluctuations are unlikely to produce the observed excesss. 

sumptions that 1) all of the planets in the high multiplicity 
systems, out to the outermost one, are accounted for in the 
Kepler data, and 2) that the high multiplicity systems are 
fundamentally the same as the low multiplicity systems save 
that two or more of the planets in the low multiplicity sys- 
tems do not transit, then we observe a few features in the 
low multiplicity sample that cannot easily be explained us- 
ing the high multiplicity sample. The study of the general 
features of the low multiplicity sample as predicted from 
the high multiplicity sample, in addition to the study of the 
anomalous features, has the potential to make important 
statements about the formation of planetary systems and 
their dynamical evolution. 

One avenue to follow, pertaining to the general prop- 
erties of the distributions, is that low multiplicity systems 
where the period ratio of adjacent planets is larger than 
a few (e.g., > 5) likely harbor as-yet undetected planets 
interleaved among the observed ones. The same may also 
be true for the high multiplicity systems. The presence or 
absence of these intermediate planets would support or re- 
fute the model of dynamically full planetary systems — or 
at least constrain the fraction of systems that are dynami- 
cally full. Large gaps in a planetary system, manifest in the 
tail of the period ratio distribution, likely indicate either 
important events or evolutionary processes in that system 
or important transitions in the structure or composition of 
the protoplanetary or planetesimal disks. For example, in 
our own solar system all of the period ratios of adjacent 
planets with one exception are well below 3:1 — consistent 
with the high multiplicity distribution. The exception is the 
Mars/Jupiter ratio at 6.3, near the possible end of the high 
multiplicity distribution and where an outlier distribution 
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would start to be observed. The Mars/ Jupiter neighborhood 
is extremely interesting as it straddles the ice line — a signif- 
icant phase transition. Moreover, if Ceres (or the asteroid 
belt generally) is included in the period ratio distribution, 
then the gap disappears entirely — replacing a period ratio of 
6.3 with a 2.4 and a 2.6. While the asteroids are not planets 
in the modern sense, they are a collection of mass located 
well within the region where our crude model predicts that 
planets commonly reside. Distant systems that are roughly 
similar to our own may have produced a planet there. 

A second avenue is to explore the evidence that a small 
population of planet pairs in the region between the 2:1 and 
3:1 MMRs may have some unique history or characteris- 
tic. We do not claim to know the cause of this excess, nor 
do we claim its existence with high confidence — since it is 
possible at about 1% of the time (2.5a) to produce it in 
the low multiplicity sample with our simple modeling ap- 
proach. We believe, however, that further investigation of 
the low-multiplicity and three-candidate systems where the 
planet orbital period ratios are between 2:1 and 3:1 is well 
motivated. If it turns out that this excess is simply due to 
undetected, intermediate planets, then that fact would im- 
ply that many planetary systems are more closely packed 
than the high multiplicity systems suggest (and that there 
is a modest tendency for an intermediate planet to have ei- 
ther a small size or a relatively large mutual inclination of 
its orbit). 

If it turns out that there are no such additional planets, 
then one must explain why this excess appears in the sys- 
tems with only a few planets but not in the high multiplicity 
systems. It may be due to strong, ongoing dynamical inter- 
actions caused by relatively large eccentricities of the planets 
in question. Or, it may be the result of singular events such 
as planet scattering or collisions. Additional analysis and 
more data will help resolve this issue. While these excess 
systems in this neighborhood may not be a large population, 
they may uncover some important insights into the dynami- 
cal histories of planetary systems generally. Such a situation 
would not be the first time that a relatively small population 
of planets on unusual orbits gave such insights (e.g., the hot 
Jupiters). These systems may have taken longer to detect 
than the hot Jupiters, but they may prove equally valuable. 
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